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Memo 44 — 



A Simple Direct Proof of Post's Normal Form Theorem 

by Marvin Minsky 



The theorem proved in this note is the Normal Form Theorem proved 
in Post's 1943 paper, "Formal Reductions of the General Combinatorial 
Decision Problem". We have long felt that this result is one of the 
most beautiful in mathematics. The fact that any formal systems can 
be reduced to Post canonical systems with a single axiom and produc- 
tions of the restricted form 

00! -*■ OLT 

is in itself a remarkable discovery, and even more so when we learn 
that this was found in 1921, long before the formalization of meta- 
mathematics became so popular. Post's proof of the theorem is quite 
readable. Our proof is somewhat simpler, and quite a bit shorter. 
Its brevity is a result of doing the necessary operations with less 
concern about the order in which they are done. Some clarity is 
gained because this technique requires fewer auxiliary symbols. 
Some clarity is lost because of the superposition of operations and, 
accordingly, we resort to illustrating the process by a detailed 
example (which brings the writing back to roughly its original length) 
Still, the techniques used here are somewhat different and give, per- 
haps, a better feeling for why the theorem is true. 

The paper conclude;.; with a modest noni:rlbu.tion--a very simple 
"Universal Canonical Extension" baaed on the same techniques. V.p 
end up with a "Universal Canonical Extension" with three lott'---; 
(minima'.! in a sense) and four productions (minimal '; ). 



A Simple Direct Proof of Post's Normal Form Theorem 

We first prove the theorem for systems having single-antecedent 

productions. Suppose that we have a Post Canonical System P for which 

a typical production has the form 

cr o?_ rT-Ou . . . ot (r -* r a. T-.O' . . . at. r (ir) 

v oll2 n n o 1 1 1 i i m 

^ ^ • y^i_ ^ 2 ™ 

antecedent , consequent 

where each variable string a. is one of the a, and cr , . . . , * and 

r , . .., t are given fixed strings. This production will apply to any 
string which has the form of the antecedent — that is, any string which 
begins with a , contains ncn- overlapping occurences of cr.., cr^, ... cr . 

(in that order) and ends with cr . We will construct a new Normal system, 
an extension P* of P, by replacing each production f by a set of new pro- 
ductions using some new symbols. The task for the system P* is (i) to 
verify that a string S can be so analysed as an antecedent of ir and then 
(ii) to assemble a consequent string composed of the r-strings with the 
proper strings for a. , ..., a. inserted in the correct positions. The 
problem j 
the form 



problem is to do all this using m only normal productions—productions of 



The difficulty is that normal productions can "look" only at the beginning 
of a string* they cannot examine a string's interior to see, e. g., if 
it contains an occurrence of cr... 

We can evade this limitation by "rotating" the string so that suc- 
cessive symbols are brought up to the beginning. The following produc- 
tions will thus test whether, in an initial string To?, the unknown 
string a has the required antecedent form. Suppose the letters of the 
original alphabet are a.-., a 2 , ..., a . We introduce new letters 1\,T 2 , 
..., T and two more, Z and Y. (We use upper-case letters for all new 
symbols used in the extension P*. ) 
T ao a — aZT x 

a .a -* oa. (j = 1, ♦ . ., r) 

T ± a .<* -+ aa .T ± (j = l,...,r i = l,...,n) 

Vl a " tf i + 1 (i « l,...,n-l) 



2. 



ZOi -+ CtZ 
T n a n Za 



<xY 



To see how this works, we study an example. Suppose that tr is 



a~a 
that is, 


lV' 


_a a, a 
D 2 1 


3 a 3 a 4 - a 2 


a l a 3 a 2 a 2 a 3 a l a 5' 


a o = a 3 r - a 2ai o^ =<* 3 n = 3 


a l = a 2 a 5 T l = a 2 a 2 %= a 3 m = 2 


a 2 = a l a 3 T 2 = a l a 5 


er 3 - a 4 


Suppose 


that 


the actual string S 


is, in fact, 


a 3 


a l a 2 


a 2 a 5 


a 4 a 2l a l a ^ 


a 2 a l 


a 4> 



which is in the form of the antecedent of tt. Then the following strings 
are produced: 

x a ja^ a^a^a j~a J. a^a^ a^a/^a^ a j. 

a-|p,QaQara|iaASl"j aQartO-j ahti X-i 

I r2 a 2 a 5 a 4 # # * 
I I apa^-a^, . • • 



a l a 4 Z T l a l 



a«i a u/j x-» a** a^ 



cj x -» a»i a^a^a p-a ^ a^a-i a^a^a-i ah 
T l a l a 2 a 2 a 4 Z 



a 2 a 2 <H Z a i T l 



l a 2 a 2 a 5 a, 4 < 



a^ar-a 



2 a 5 a 4 4 



ail Li 0,-y 

&.U& a«i a^x * 



• in 3,Qar9-|i • • • • • Siu/j a-i cIa* 



Now there is a choice, for the system can apply either 



T l a 2 a "* aa 2 T l 



or 



T 1 a 2 a 5 « 



aT^ 



Thus two strings are produced; 
a^a^. . 



and 



• CLUcj a~i dLr\Cir\ X -i 

• &h/j a -» a q x q ♦ 



3- 



The first string is ultimately doomed, because there will never again 
be an opportunity to eliminate the symbol T-, from it — so it can never 
yield a string in the original alphabet of P. But the second string 
can continue on to 

O ii P 1 "-Q^p 1 il 1 O 

&pE-j Sa&p&i E|i6 a^apa||ip 



T 2 a 2 a l a 5 a 2 a l a 4 Z a l a 2 a 4 



1 pd^-i a^apa-i s.uu a-i apa ^ ap ♦ 



Again there is a choice, producing 
and 



s^SaS--! &|i ^ a«i 8»p&h &p&"| -*"p 



a 2 a 1 a if Z a^a^a^ 

Again the first string is doomed, since Tp will never be eliminated, 
but the second string leads to 
i^EacIi EhD a«j s»qBlu ap 



i^a-i b»ulj a«i apa^apap 

a 4 Z a l a 2 a 4 a 2 a 2 a l T 3 



"3 ii 1 Q'pQ'jh ^p^o^l 

a l a 2 a 4 a 2 a 2 a l Y 

A string containing a Y can be produced only if the original 
string had the required antecedent form! This shows how we can com- 
pound normal productions to analyse into the interior of a string. 

We have shown how normal productions can be used to determine 
whether or not a string has the antecedent form* But we cannot use 
our result to construct the consequent , because the resulting string 
(containing 'Y 1 ) has the a-strings "run together". That i$ we have 
destroyed information needed to show how the string 
a-i a^apai^ a^a-i 

is composed of three strings a,, o< , and a . To correct this we present 



4. 



a revised (and final) version of the productions above, 
Productions for the Normal Extension 

The complete method requires a new set of symbols 



A 



1 

1' 

2 



A 



1 
2' 

2 



A. 



A n A n 
1* 2* 



,n 



These are used to keep track of the contents of the strings a^, or, . . 



9 n 



The new productions are 



Trr a -* a Z T n 
o 1 

Z a -* a Z 

Y a -+ a Y 

* aa 



a j a 



j 



Vi r<ff w 
T i a j°^ aA j T r- 



A ' a 



-*• rvA 



OfA, 



T a ZortxQ. 

n n 



(j = 1, . 

(1=1, • 

(1 = 1, . 

(all l, j) 



.,n-l) 

. ,n-l; j=l, . • .,r) 



a, = a-,a 2 , ff 2 = a 4 a p» an( * a -z = a p a i* 



The only change concerns the symbols A,. If the reader retraces the 
example above he will find that the final result will be 

Q aJa^A^ 

so that now, by inspecting the superscripts, one can tell that 

All the required information has 
been preserved, so we can go on to construct the consequent. 

Before going on, observe that if the original string can be resolved 
into antecedent form in several ways we will get a distinct Q-string for 
each. Thus if we start with the string 

in our previous example, we can analyse it either as 



a 



or as 







'0 



a 2 a 2 a 5| °2 



<7. 



a l a 2 a 5 a 2 



rr 



lQ": 



'3' 



Therefore the system yields the two Y-strings 



and 



1 2 ? P ^ 



In the latter, a 2 is & null-string, which is perfectly all right. 

Now we construct the consequent* We will use the letter Q to 
initiate the process by setting-up the consequent form into which the 

-strings are to be copied, 
the places corresponding to the variables a-., 
The conse quent form is introduced by the prod uction 



•1 y* 

We will use new letters V ,...,V to mark 



, a in the consequent. 



Qa - otry i T 1 V 12 r 2 



V x m r Z Y, 

v 'm 9 



where T n , ...,T m are the constant strings of the consequent of ir. A set 



of produc 



felons 



A J a k* "* "V j 



allow the A's to move freely to the right, passing over letters in 
the original alphabet. -We do not allow the A's to pass one another, 
hence their order is always preserved. We also allow the A's to pass 
over some of the V 1 s: 



A^a -+ avVj-l (if l£ k) 



Hence the A ! s ignore V's with different superscripts. But when the 
superscr ipts agree, we use 



aVo? -♦ tfV a, A 



r 

.a ♦ 

J J 



Hence, whenever an A. passes over a V with the same superscript — 
corresponding to the same variable string a. — it leaves there a copy of 
that letter a. which caused that A-symbol to appear. The result is 
that when all A's have passed over all V's, each V will be followed by 
a copy of the corresponding arstring. To see this in our example, 

Q k\k\k\k\k\k\ -+ 

A ^2 A 4 A 2 A l A l a 2 a l v3a 2 a 2 v3a l% Z Y 

5 

.... 

A^A2A^A^A3a 2 a 1 V 3 a 1 a 2 a 2 V 3 a 1 a 1 a 5 A^Z Y 
a 2 a l v3a 2 a l a 2 a 2 2 a l a l a 5 A l A 2 A 4 A 2 A 2 A l Z Y 



and we see that a copy of a_ = a,a 2 has been inserted to the right of 
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each occurrence of V J = V ^ = V 2 # 

When the string reaches this form, all the work is done and it 
remains o nly to elimina te the auxiliary letters of P*. The productions 



A^Za -► <*Z 



remove the A's, while the productions 



Ya .ct -+ o?a ,Y 
Y V 1 ^ olY 



erase all the V f s. Now all these events can occur in many different 
orders. However the net effect is always the same, once Y arrives at 
the immediate left of Z. For then all A«s have been erased and also all 
Vs. When this happens we apply, finally, 



YZa -* aT 



yielding the ultimate desired result 

o i x 1 i 2 i m m 

Observe that Y cannot pass over an h\. Therefore, if Y is to move over 
to Z it must "push" the A's ahead of it, and hence all A's must even- 
tually pass over all V's. Y erases V's when it comes to them, but 
clearly it cannot erase a V until it gets to it^and to get there it must 
have pushed all A's past that point. Thus no V is erased until it has 
thus done its work of copying. 

Problem: Prove the theorem using productions a®** otr in which neither 
cr nor r have more than two, and not both have two, letters. Do not 
hesitate to use great quantities of letters. 
Completing the proof 

There remain a few loose ends. Observe first that we have not 
really produced a legitimate extension P* of P, because P* does not 
produce any strings which have only letters of P. Proof of this 
statement: The axioms of P* all contain the upper-case letter T. Every 
production which involves an upper-case letter in its antecedent constant 
string produces at least one such letter in its consequent string. There- 
fore, (by induction) each produced string contains an upper-case letter. 

Obviously, the strings we wish to "detach" are those which start 
with 'TV In fact any string Ta can be interpreted as an assertion 
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that a is a Theorem of the system P. It would be tempting simply to 
introduce a production 

Ta -► Of 
to remove an initial T. The trouble is that this would lead to 
spurious strings in the P- alphabet, because of the productions 

which are already part of the system. This shows, in fact, that there is 
no way to obtain the desired result in our system, because we will 
always get "rotated 11 strings with legitimate strings. The cure requires 
a new alphabet. Suppose that the alphabet of P is really b 1 ,.. .,b r 
and that a-,,..*, a are really new letters of P*. Then we can "detach" 
1 T 1 by the productions 



Ta j a 


— 


aRb j 


a f 


— ► 


ab J 


R a 


"♦ 


OL 



The trick depends on the fact that we do not allow the b f s to rotate 
(there are no productions b.a-*ort>,.) 



Of 



course these productions make it possible for an a . to be con- 



verted to a b, at any step of the process, but when this happens 
prematurely it only yields "doomed" strings which are not pure strings 
in P and cannot yield any of same, once the f b f letter rotates to the 
front of the string. Only in the case that the conversion begins with 

'Ta. 1 , so that the resulting string comes to begin with «R f , can the 
(otherwise) always-present upper-case letter be removed, for 

Ra -* a. 
is the only production of the entire system that can eliminate the 
auxiliary upper-case letter. 

Another loose end is involved in the case of systems with more 
than one production. Here we simply carry out the whole above construc- 
tion for each production, using entirely new letters for all upper- 
case letters except T. Then the productions can operate independently, 
only linked by T so that each system can operate on the final results of 
the operation of other systems. 

We need one more theorem to complete our proof that the Canonical 
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systems of Post, in their full generality, can be replaced by Normal 
Extensions. We have to account for the case of productions which 
require several, rather than just one, strings in the antecedent. The 
proof of this, below, shows that this most general system can be reduced 
(in the form of a Canonical Extension) to the case of the single- ante- 
cedent production. Having shown this, the result of the present section 
completes the proof of the general theorem. We feel that the method of 
the following proof is instructive in itself, if only in showing how 
it may be useful to employ, for theoretical use only, a kind of reck- 
less extravagance. 

Reduction of multiple-antecedent productions to single-antecedent 
productions 

We consider now the most general form of Post's Canonical systems, 
in which a production may require more than one existing string in order 
to produce a new string. A most classical example of this occurs in 
elementary logic where from A and A=^B one produces B. In the most 
general system P a production has the form 



(r 10 a ll cr ll a 12 • • * 'ln^ 
CT 20 0? 21 CT 21 0f 22 " # ' ^ngi 
a S0 a Sl a Sl Of S2 • • * ^Sng 



~* T 0*1 T 1*2 • • • T r 



M 



where the cr's and t's are given constant strings and each e* k is one 
of the <* 's. 

Below we show how to make a canonical extension P* of P which has 
only single-antecedent productions. Let a,,...,a be the alphabet of 
P. Let <!>,...,$ be the strings which are the Axioms of P. Our new 
system will have but one axiom: let ! A f be a new symbol. 
AAfcjAAOgAA . . . AA* AA 



(There would be little point to having several axioms now, since with 
single-antecedent productions there could be no combining of their sep- 
arate consequences.) Corresponding to the old production ir we introduce 
one new symbol X and the following monstruous production 7r*: 
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aS ^10*11*11*12 



A(T S0 a Sl (T Sl a S2 



A V a io 



# • • XXyfljpto^ ]_ a 1 2 



a i ni a 21 



"ln/V^o' 


*21 CT 2l a 22 


... aS 2 


A ... 


' CT Sn s A8 S A 






1 
1 


• « • 




• # • 




^r^V^ 


a l T i a 2 


• • * 




•" a 2n 2 ••' 


a si ••* 


°Sn s X 





(it*) 



This has the effect of analysing a string to see if it contains sub- 
strings which have the forms required of the antecedents. If it finds 
such strings, sandwiched between A's, and if they occur in the given 
order—an undesirable requirement to be lifted shortly— it adjoins the 
resulting consequent, and some other material. 

Why cannot we simply adjoin the consequent? The reason is that 
the analysis may be spurious, in that one of the <*j/ s can contain too 
much — parts from more than one proper P-string. This will be the case 
if any a ± . contains one or more ! A f s in its interior. This is why our 
production 7r* finally appends all the discovered variable strings in 
the form 



*11 



a sn s X ' 



Note that the form does not end now with an A, so that ?r* can not apply « 
We now check to see if this appendage contains no A f s: 

(j = l,...,r) 



a,Xa .e* 2 X -* o^XttgX 
a- XX -* a-AA 



Then the string will return to standard form only if there were no A f s 
in any of the selected a. .'s. 

The result is that as new strings are produced, in accord with the 
productions of the original system P, they are appended (between f AA's) 
to the axiom string. Any assertion of P is ultimately so generated, 
and appears preceded by its entire "production history". 

An extension system is required to yield the original strings with- 
out any auxiliary letters. We do this by a similar method: 
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Aa,Aa 2 Aa-A -*• Ya 2 Y 



a l Ya j a 2 Y 



a l a j YOf 2 Y 



gYY 



o> 



(j = l,...,r) 



The result is that any P-string which occurs between A's In a 
standard form P*-string will be released without auxiliary letters. 

One loose end. The production v* requires the antecedents of w 
to occur in a given order. There is no meaning to such a restriction 
in P. To eliminate it we could introduce (rather than just ir*) a distinct 
production like ir* for each possible order of the antecedents-- that is, 
we need Si forms of ir* with the antecedent parts permuted. This would 
complete the construction. A more elegant solution: we adjoin the 
single production 



Aa,AAa AAa A -► Ao^AAagAAa.AAagAA. 



This makes it possible to take any interior P- string and append it to 
the end of the compound P*-string. Thus the required antecedents can 
be placed at the end in any order required by a ir* production. 
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Part II 
A "universal" canonical extension 

It Is very easy to construct an analogue of a Universal Turing 
machine in the form of a canonical system. Let P be an arbitrary Normal 
system with axioms ^ , . . . , <l s --and productions tr^cr* otr^. Let a 1 ,...,a r 
be the alphabet of P, and let A, C, S, and T be new letters. Our 
system ^tfpWill have the single axiom 

ACAcT 1 CT 1 Afr 2 CT 2 . . . A<r Ci^ASfcjS^ . . .S$ S STTT. 

Observe that this axiom is a complete description of P, since from it 
one can reconstruct all the axioms and productions of P. Our productions 
are:* 



"l 


a. Aa<J TAa 2 t>pai>o_ ' 


■ 2 J 


a^Ta^Ta .aJIotj. "* a,' 


^ 


a,TTTa 2 "* a-, TTT 


u 4 


c,TTTa 2 -► « 2 



* 'rcy 'rot}. 



(j = l,...,r) 



We assert that this system is a canonical extension of P! The 
remarkable aspect of this is that the productions ofVp do not depend 
in any way on the structure of P (except for the restriction on the 
alphabet, which we will dispose of shortly). Thus^J p is like a 
Universal Turing machine in that, given (as its axiom) a description 
of an arbitrary canonical system P, the productions of XT will generate 
precisely the theorems of P. 

To see that °ifplQ a canonical extension of P, let us examine the 
effect of tt 1 . ir 1 "looks" for a substring ActCtA and a substring S|3aS 
and "attempts" to adjoin the string otr in accord with some production 
of P 

a. a -* or. . 

i i 

This operation is valid only if (l) p = a and (2) if the strings a, 0, 
a, t do not contain upper-case letters. For if a string has the form 

* Here, all Greek letters represent variables. 
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ActCtA with a and r having no upper-case letters, then rrandrmust be 

some a. and r. . The productions w check both conditions, for they 
i i d . 

remove, one at a time, letters from J aocr and P<xt, but only if the first 
letter of each is at each step the same. Both strings can vanish only 
if <t = 0. Then production ?r~ can apply, adjoining the produced string 
cr, as though it were a new axiom of P. Production ir^ detaches the 
new string from all upper-case letters, to yield a legitimate theorem of 
P. 

We put the initial 'AC 1 in tt, so that the system would be sure to 
produce the axioms of P, in accord with the production 

a -* at 
which may not be included among the productions of P. 

Now for the alphabet problem. We simply encode P f s alphabet a 1# . 
. .,a into a two-letter alphabet a, b: simply let 'a* be represented 
by a 'b 1 followed by l a l s. Then everything works as before, and we 
need only two productions 7r 2a and w 2h forf. So *V ends up with just 
five productions. It no longer is an extension of P, but it is (with 
encoded axiom of P) an extension of a trivial re-coding of P. 
Problem 1 . Three letters is minimal for a Universal Canonical Extension. 
(Why?) But we can construct a version of V with only four productions. 
Hint: A simple change in w 1 makes ir^ unnecessary. Solution on next 

page . 

I can't see how less than four productions could suffice, for we 
need one to "do the work", two to check the alphabet conditions, and one 
to "release" the pure string. Such reasoning, however, usually turns 
out to be unsound I 

Problem 2 . Construct a version of with only one extension letter, 
still using only four productions. (Easy) 

Problem 3 . Construct a version of V using only normal productions. 
(Laborious . ) 

Problem 4 . Using the analogy with a Universal Turing machine, construct 
an unsolvable decision problem fori/. Consider the prospect of develop- 
ing the theory of computability on this basis, rather than on the Turing 
basis. 



Solution to problem It 

Don't Read Unless 

You 

Olvo Up 

on 
Problem 1. 
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o~AovrM^3Ba» 


*2* : 


%**% ^wJB^W^^% ^■^B^^W^J IBJ^^*I 


T 2b : 


Qti| XoO^X(p«KM3Hb| 


** s 
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